ISciences Open Science Portfolio

Advancing Earth Science Research Through Open Tools and Education

Joshua Brinks

Research Scientist, ISciences, LLC
jbrinks@isciences.com

Presentation Outline

  • ISciences Overview
  • Highlight several open science efforts
  • Lessons learned and closing thoughts

ISciences

Who We Are

  • FOSS Focused Research and technology firm based in Burlington, VT
  • Specializing in geospatial analytics and applied statistical modeling
  • Focus on water resources, political instability environmental monitoring, and risk assessment
  • Small team of scientists, engineers, and developers

What We Do

  • Partner with agencies including NASA, NOAA, USACE, intelligence
  • Develop water security monitoring systems (WSIM)
  • Develop open source software, packages, and educational materials
  • Support decision-making through data-driven insights

Broader Benefits

  • Roadmaps for access to data and tools
  • Reduce barriers for learning these tool
  • Enable participation from diverse stakeholders
  • Solicit feeback from under-resourced communities
  • Builds trust through transparency

DANTE

Data Analysis Toolbox for Environmental Research

Overview

  • Collaborative open-science platform developed by USACE, ISciences, Columbia’s CIESIN, and CASE International
  • Accelerates multidisciplinary environment-security applied research
  • Bridges political science, social science, geography, environmental sciences, and defense communities

DANTE Project Website | GitLab Project

TOPS-SCHOOL

ScienceCore Heuristics for Open Science Outcomes in Learning

Overview
  • NASA-funded Open Science training curriculum
  • Joint effort between CIESIN, ISciences, CUNY
  • Emphasizes hands-on learning with real NASA datasets
  • Cloud-based learning environment/workshops (2i2c)
  • Case studies using recent environmental events

Water Module | Air Quality Module | Disasters Module | Climate and Ag. Module

TOPS-SCHOOL Case Studies

SCALAR

ScienceCore Curriculum for AI Literacy and Research

Overview

  • Proposed NASA HPOSS-funded follow-on to TOPS-SCHOOL
  • Addresses AI literacy challenges in Earth Sciences
  • Teaches responsible integration of LLMs with NASA data
  • Focuses on recent missions: TEMPO, ECOSTRESS, SPORT-LIS, SWOT, PACE
  • Status: Seeking funding and institutional home

Learning Outcomes

  • Access NASA Earth Science datasets with AI assistance
  • Design effective prompts for scientific contexts
  • Recognize and mitigate AI hallucinations and bias
  • Validate AI-generated code and analyses
  • Apply open science principles to AI workflows

nClimGrid Explorer

Interactive Climate Data Visualization Tool

  • R package and Shiny web application for NOAA nClimGrid-Daily dataset
  • Funded by ESIP Lab grant program
  • Provides open-science tools for climate and social vulnerability analysis

nClimGrid Explorer

  • Lots of stakeholder possibilities
    • Political representatives
    • Land conservancies
    • Municipal climate support systems
    • EJ and vulnerable population support
  • Deciding where to go next after beta release

Gitlab Repo | Package Site | Shiny App | S3 Bucket | TOPS SCHOOL Lesson

exactextractr Package

High-Performance Zonal Statistics for R

Overview

  • Fast and accurate zonal statistics extraction from raster data
  • Orders of magnitude faster than competing methods (raster::extract, velox, ESRI, QGIS, GRASS, rasterio)
  • Implements precise algorithms for area-weighted statistics
  • Coverage fraction calculations for accurate area-weighted statistics

Main Website | R Package Website | Python Module Website

GDAL Contributions

About GDAL

  • Industry-standard FOSS geospatial data abstraction library
  • Powers countless GIS applications and tools worldwide
  • Supports 200+ raster and vector formats

Latest Contribution: gdal raster zonal-stats

  • New command in GDAL 3.12.0 “Chicoutimi” (upcoming release)
  • Brings exactextractr-style performance to native GDAL

Key Features

  • 20+ statistics: mean, sum, min/max, variance, weighted variants
  • Weighting support: population-weighted calculations
  • Pipeline integration: combine with other GDAL operations
  • Works with polygons or categorical rasters

Latest GDAL Release | gdal raster zonal stats

Water Security Indicator Model (WSIM)

Overview

  • Global water security monitoring and forecasting system
  • Operating continuously since April 2011
  • Monthly reports with 1-9 month lead-times

Current Status & Future

  • Open source code base
  • Monthly regional blogs posted on our site
  • Detailed reports available via ISciences
  • Vision: Open cloud-based version for public access

WSIM Website | GitLab Repository

WSIM

  • Return period methodology (3-40+ year events)
  • Quarter degree resolution
  • Five severity categories: abnormal to exceptional
  • Depicts both deficits (red) and surpluses (blue)
  • Purple shading indicates concurrent deficit/surplus
  • Data Source: ECMWF Reanalysis v5 (ERA5)
  • Baseline Period: 1981-2020 (satellite era focus)
  • Forecast Model: NOAA CFSv2 ensemble forecasts
  • Composite anomaly index based on soil moisture, ET deficit, runoff, and blue water

NIDIS / Drought.gov

  • We have multiple employees serving in various capacities of drought.gov
  • Awaiting go-ahead on an open science NIDIS grant addressing non-stationarity in long term climatologies.
    • Stakeholder advisory panel
    • Full cloud hosted open data and code
    • Integration with drought.gov forecasting and warning system
    • Previewing at AMS in January

Lessons Learned

  • Stakeholder engagement and design
  • Niche domains have proven difficult
  • Simplify the front end product
  • Not everyone is on board

Thank You

Acknowledgments

  • ESIP
  • NASA TOPS/Open Science
  • NASA ROSES
  • Open Science Team and SCHOOL contributors
  • Columbia University
    • Center for International Earth Science Information Network (CIESIN)
    • SEDAC
  • USACE Coastal Hydrology Lab
  • GDAL and FOSS contributors

All materials available under open licenses. Visit our repositories for code, data, and documentation.